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Abstract. The truncated variation, TV C , is a fairly new concept introduced in [5]- Roughly 
speaking, given a cadlag function /, its truncated variation is "the total variation which 
does not pay attention to small changes of /, below some threshold c > 0". The very 
basic consequence of such approach is that contrary to the total variation, TV C is always 
finite. This is appealing to the stochastic analysis where so-far large classes of processes, 
like semimartingales or diffusions, could not be studied with the total variation. Recently 
in [6], another characterization of TV C was found. Namely TV C is the smallest possible 
total variation of a function which approximates / uniformly with accuracy c/2. Due to 
these properties we envisage that TV C might be a useful concept both in the theory and 
applications of stochastic processes. 

For this reason we decided to determine some properties of TV C for some well-known 
processes. In course of our research we discover intimate connections with already known 
concepts of the stochastic processes theory. 

Firstly, for semimartingales we proved that TV C is of order c _1 and the normalized trun- 
cated variation converges almost surely to the quadratic variation of the semimartingale as 
c \ 0. Secondly, we studied the rate of this convergence. As this task was much more 
demanding we narrowed to the class of diffusions (with some mild additional assumptions). 
We obtained the weak convergence to a so-called Ocone martingale. These results can be 
viewed as some kind of law of large numbers and the corresponding central limit theorem. 

Finally, for a Brownian motion with a drift we proved the behavior of TV C on intervals 
going to infinity. Again, we obtained a LLN and CLT, though in this case they have a 
different interpretation and were easier to prove. 

All the results above were obtained in a functional setting, viz. we worked with processes 
describing the growth of the truncated variation in time. Moreover, in the same respect we 
also treated two closely related quantities - the so-called upward truncated variation and 
downward truncated variation. 
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1. Introduction and results 



Recently, the following notion of the truncated variation has been introduced in [5]: 

71-1 

(1.1) TV c (/,[a;6]) := sup sup £ 4> c (\f(t i+1 ) - f(U)\) , 

n a<ti<t2<...<t„<b i=1 

where <j) c {x) = max {3; — c, 0} , c > and / : [a; b] 1— > R is a cadlag function. The trivial 
observation is that TV is nothing else that the total variation (which will be also denoted 
by TV). The introduction of the truncation parameter c makes it possible to circumvent 
a classical problem of stochastic analysis; namely, that the total variation of the Brownian 
motion as well as of a 'non-trivial' diffusion process is almost surely infinite. This alone makes 
TV C an interesting research object. Other properties of TV C were found, amongst which the 
variational characterization of the truncated variation given by 

(1.2) TV C (/,M])= inf {tV (</>;&]) : 5 such that \\g - < 1 C J , 

where WgW^ := sup{|<?(:r)| : x E [a; b]}. In other words, truncated variation is the lower bound 
for the total variation of functions approximating / with accuracy c. It appears that the inf in 
the above expression is attained at some function g c . The properties just listed give hope that 
TV C could be used in the stochastic analysis. This question is a active field of research, some 
promising results are contained in [7], like definition of a stochastic integral with respect to a 
semimartingale as a limit of the pathwise Riemann-Stieltjes stochastic integrals, and other are 
being investigated. A detailed description would be too vast for our introduction therefore we 
refer the reader to [7] and [6] , and its debriefing in Section [2j 

Having agreed that TV C might be a useful tool, an important task is to describe the behavior 
of TV C for a vast class of stochastic processes. This is the main aim of this paper. We will 
derive first order properties for continuous semimartingales and second order properties for 
continuous diffusions (under some mild technical assumptions) when c\0. Intuitively, these 
answer the question of how fast TV C converges to the total variation, that is how fast it diverges 
to infinity. In the case of the Brownian motion with drift we will also study the behavior of 
TV C on large time intervals. 

Before presenting our results we define two concepts closely related to TV C . The upward 
truncated variation given by 

n 

(1.3) UTV c (/,[a;6]) := sup sup V 4>c (/(*) - /(*<)) , 

71 a<t 1 <si<t2<S2<...<t n <s n <b i=1 

and the downward truncated variation given by 

n 

DTV c (/,[a;6]) := sup sup £ C - /(*,)) • 

71 a<t 1 <s 1 <t2<S2<--.<t„<s n <b i=1 



The relation between TV C , UTV C , DTV C will become clear in Section \2. II Given a ci^cedli'^ceg 
process {Xt} t>0 we define the following families of processes {TV C (X, t)} t>0 , {XJTV C (X, t)} t>0 
and {DTV c (X,t)} t > by 

TV C (X, t) := TY C (X, [0; t]) , \JTY C (X, t) := XJTV C (X, [0; *]) , DTV C (X, t) := BTY C (X, [0; t]) , 

where all the above definitions are understood in a pathwise fashion. Obviously, all three pro- 
cesses are increasing. Moreover, for semimartingales and c\0, under weak non-degeneracy 
conditions, their values diverge up to infinity. Thus a natural question arises what the growth 
rate of the (upward, downward) truncated variation is. Under a proper normalization we 
expect also some convergence to a non-trivial object. These questions are answered in the 
following section. 

1.1. Behavior as c \ 0. First order properties for continuous semimartingales. For 

a continuous semimartingale {X} te r . T i we will denote its decomposition by 

X t := X + M t + A t , iG[0;T], 

where M is a continuous local martingale such that Mq = and A is a continuous finite 
variation process such that Aq = 0. Given T > 0, by C([0;T],M) we denote the usual space of 
continuous functions on [0;T] endowed with the topology given by norm || • H^. 

Theorem 1. Let T > and let {X} t£ ^ . T ^ be a continuous semimartingale as above. We have 

lime TV°(X,t) -»• {X) t , a.s. 
c\0 

lim c UTV°(X, t) -> {X) t /2, a.s. 

c\0 

and 

lim c DTV^iX, t) -»• (X) t /2, a.s. 

c\0 

In all cases the converge is understood in the C([0;T],R) topology. 

Remark 2. One can see that TV C is of order c . Hence by the discussion above this is also 
the lower bound of the total variation of the approximation of X in || ■ H^-ball of radius c/2. 
For diffusions we will find finer estimates in the next section. 

Assumptions of Theorem Q] could be weakened slightly. Without additional effort we can 
prove the theorem for A not being necessary continuous. This is however cumbersome from no- 
tational point of view, as we cannot work in C([0; T], R) space. The problem of non-continuous 
semimartingales will be treated in full extent in future papers. 

Remark 3. Theorem Q] could be considered as some kind of a law of large numbers. We will 
now provide a rough justification using the Wiener process W as an example. One can imagine 
splitting an interval [0; 1] into c -2 parts. On each part W performs a motion of order c. The 
contribution of the part to the total truncated variations is not negligible and is of order c. 
The contributions are random and "almost" independent for non- neighboring parts. Therefore 
there is no randomness in the limit. 



Remark 4. The heuristics presented in the previous remark is nice at the intuitive level however 
a more precise description is required to perform the proof. In the case of a Wiener process 
with drift this will be a precise characterization of ti,t2, ■ ■ ■ for which the sup in definition 
(jl.ip is attained, which will lead to a natural renewal structure . In the case of a general 
semimartingale following the same path seems to be hopeless. To circumvent the problem 
we employed an abstract approach based of time change techniques in spirit of the Dambis, 
Dubins-Schwarz theorem Chapt. V, Theorem 1.6]. 

Having explained "the law of large numbers nature" of the above result a natural question 
arises about the corresponding central limit theorem. This will be addressed in the next section 
for {X t } t>0 being a diffusion satisfying some mild conditions. 

1.2. Behavior as c \ 0. Second order properties for diffusions. Let us now consider a 
general diffusion defined with equation 

(1.4) dX t = a(X t )dW t + fi(X t )dt, X = 0, 

We will always assume that a, fi are Lipschitz functions and a > 0. It is well known, |11| Sect. 
IX. 2], that under these conditions the equation admits a unique strong solution. The main 
result of this section is 

Theorem 5. Let T > then 



;i.s) 



1 f(X) t , \ n lf(X) t \ ^. v . {X) t 



X, UTV(X, t)-- + X t ) , DTV°(X, t)-- -X t ), TV(X, t) 

(X,M t ,Mt,2M t ), asc\0, 

where M is given by the change time formula: 

(1.6) M t :=l2~ l ' 2 B {x)t , 

where B is a standard Brownian motion such that B and X are independent. The convergence 
is understood as the weak convergence in C([0;T],M) 4 topology. 

Remark 6. Let us notice that by O Proposition 5.33] from the joint convergence of X and 
three other processes related to UTV, DTV and TV one obtains their stable convergence as 
described in [3l Sect. VIII. 5]. 

Remark 7. Let us now present an intuitive explanation of the result on the example of a 
Wiener process with drift, W and the truncated variation. Theorem [1] reads as 

cTV c (W,t) -> t, a.s. 

and by Theorem [S] and the fact that {W)t = t we obtain 

TY c (X,t)-- ^ d T ll2 B t . 
c 



In this case the theorems are indeed an "almost classical" law of large numbers and central 
limit theorem. This stems from the fact that TV C in this case has a particularly nice, renewal 
structure. 

On the intuitive level, by equation (jl.2p one may say that for any path of W on interval 
[a; b], minimal "vertical" length of graph of any random function / : [a; b] — > R, uniformly close 
to this path must be at least equal to 

b — a lb — a 

where c = 2 sup t£ ^ a .^ \f (t) — Wt \ , and R c is a random variable such that it tends in distribution 
to a standard normal distribution N (0, 1) as c \ 0. Note that for small c's this lower bound 
is almost deterministic. 

Remark 8. It is easy to check that (M) = (X). Let M be the local martingale in the 
semimartingale decomposition of X. It is natural to ask how the laws of M and M are 
related. The martingale of the form given by equation (11 .61) were introduced in [9] and are 
called Ocone martingales. By results of [15] it follows that M is an Ocone martingale only if 
a = const (i.e. X is a Brownian motion with some stochastic drift). 

Let us also notice that a = const is also the only case when {X)t is a deterministic process. 

Ocone martingales have particularly simple structure which sometimes makes it easy to 
draw conclusion about them. As an example we consider a situation when a < C. Then 

P f sup M t > a J < P f sup 12-^ 2 B t > a ) = P ( sup B t > (12/C) 1/2 a j , 

\t6[0;T] J \t£[0;CT] J \t£[0;T] J 

hence M has a Gaussian concentration. Further properties and references can be found in 

m 

Remark 9. The assumption a > is equivalent to a ^ 0. This follows by the fact that a is 
continuous so, under the assumption that a ^ 0, either a > for any x or a < 0. In the latter 
case one can simply take —a instead of a and obtain a diffusion with the same law. 
The case when a may attain value requires further studies. To see this let us consider "a 
very degenerate case" when a = on an interval [xq; x\] for xq <x\. For any x G (xq; Xi) the 
diffusion degenerates locally to a deterministic process, a solution of an ordinary differential 
equation, with a bounded total variation. Hence the above formulation of the CLT does not 
make sense. While this case was relatively easy, the situation becomes more involved for 
border points xq,x\ or "isolated" 0's. We suspect that in such cases a non-trivial correction 
term containing the local time may be required. 

Remark 10. Similarly as in the case of the law of large numbers (see Remark |4|) the proof 
splits into technically different parts. 

The first one deals with the Wiener process with drift Xf = Wt + [it. We use here the fact 
that TV C (X, t) has a fairly simple renewal-like structure. Moreover, it is possible to derive 
explicit formulas for the Laplace transform of the increments of the truncated variation. Then 
a very simple argument allows to treat random drift, i.e. the case where a is a random variable 
independent of W. 



The second step deals with diffusions with a = const. Namely, on a small interval we have 
X/\t+t — X-t « a{W/\t+t — Wt) + fi(Xt)At := Ya* which is essentially a Wiener process with a 
random drift as above. It turns out that we may control the quality of the approximation to 
conclude the proof using some metric-theoretic tricks and the Prohorov metric in this case. 

As explained in Remark HHH this approach fails in the case of non-constant a. Here we appeal 
to a time change technique and a Renyi mixing-like argument (see e.g. \13\ p. 309]. A reader 
familiar with this kind of reasoning may recognize that this is why we get the independence 
in equation fjl . 6[) . 

1.3. Large time results. For the Wiener process with drift it is possible to derive results for 
large time. In this section, we put 

X:=W t + fit, 

Firstly, we present 

Fact 11. Let T > and c> 0. We have 

lim TV°(X,nt) jn -> m c u t, a.s., 

where the convergence is understood in C([0;T],K) topology and 



mC = Ucoth(c^) ifn / 0, 

^ I C" 1 if Li = 0. 



Analogously we have 



and 



lim UTV c {X,nt) /n -> -n c u t, a.s., 



lim DTV°{X,nt) jn -+ -n°t, a.s. 

n— H-oo 2 
where again the convergence is understood in C([0; T], R) topology and 

^ c = Ucoth(c//) +/x iffJ.^0, 
[c- 1 z//x = 0. 

The quality of the above approximation is studied in 

Theorem 12. Let T > and c > 0. We /iawe 

= >• VBt, asn — >■ +oo, 

where — > d is understood as weak convergence in C([0;T],M) topology; is given by M.l\) and 



K) 



2 = J sinh^) + 1 " U ' 



[1/3 #> = 0. 

Theorem 13. Let T > and c > 0. We /iawe 



UTV(X,nt) - \npit d 



Bf, as n — > +oo, 



and 



DTViX^nt)- Intent d 



B t , as n — > +00, 



where — > d is understood as weak convergence in C([0; T], R) topology; n c u is given by l\1.8\) and 



(Pi) 



( 2cxp(4c^)(sinh(2c^)-2c^) - f , n 
2 = J (cxp(2c M )-l) 3 V M ?= U, 

I 1/3 i//i = 0. 



Remark 14. Fact [TT] could be considered as a kind of law of large numbers. Indeed, TV C 
builds up over time (cf. Subsection 12. ip and because of the homogeneity of X its truncated 
variation can be decomposed into a number of independent increments. These increments are 
also square integrable, therefore Theorem [121 and Theorem [T3l hold. 

The task of proving analogous facts for more general classes of processes seems to be elusive 
at the moment. Firstly, our methods failed in this case, but the reason seems to lie deeper 
than that. It is connected with the fact that the truncated variation depends on the paths 
in a rather complicated way, simplifying only when c \ 0. We suspect that it is possible to 
prove similar results for ergodic Markov processes. This however seem a little unsatisfactory 
as in this case the convergence stems merely from the fact that on distant intervals the process 
itself is nearly independent. 



Remark 15. It is possible for the finite dimensional distributions of the normalized truncated 
variation processes appearing in Theorem [T2] and Theorem 1131 to obtain even stronger results, 
namely the Berry-Esseen-type estimates of the rate of convergence to normal distribution. 
The straightforward way to obtain such estimates is to use the already mentioned cumulative 
structure of the truncated variation processes of a Brownian motion with drift and |12[ Theo- 
rem 8.2]. One can check that the appropriate moments exist (see formula equation (j3. 15|) and 
observe that inter-renewal times in this case have the same distribution as the exit time of 
Brownian motion with drift from a strip. Thus we obtain that the difference between the cdf of 
the multidimensional projection of the limit distribution and the cdf of the finite dimensional 
distributions of the normalized truncated variation processes in Theorem [12] and Theorem [13] 
is of order \og(n)/y/n. We suspect that the results of Theorem [5] can be strengthened in a 
similar way. This will be a subject of further studies. 



Let us now comment on the structure of the paper. In the next section we gather facts 
about the truncated variation and discuss potential application to the theory of stochastic 
processes. Section [3] is devoted to the proof of Theorem [5] In section 0] we present the proof 
of Theorem [1] Finally in Section [5] we sketch the proof of the large time results presented just 
above. 



Acknowledgments. We would like to thank Radoslaw Adamczak for help in the proof of 
Lemma [271 We thank also the anonymous referee for useful comments. 



2. Properties of the truncated variation 



This section is based on results of [6]. For reader's convenience we keep much of the notation 
introduced there. 

Arguably the most interesting property of the TV C was listed in equation (II. 2p . Another 
closely related property is given by 

(2.1) TV C (/, [a; b]) = inf {TY(g, [a;b]) : <?such that \\g - f\\ osc < c,g(a) = f (a)} , 

where 1 1 ft, 1 1 osc := sup{|/i(x) — h(y)\ : x,y G [a; b]}. The infimum in equation (|2.ip attained for 
some g 0,c : [a; b] i— > IR, which is unique. Moreover, we also have the following explicit repre- 
sentation: 

(2.2) g°> c (s) = f(a) + UTV C (/, [a; s\) - DTV C (/, [a; s\) 
and 

(2-3) lb°' c -/lloo<c 

g°' c is also closely related to the solution of the problem stated in equation (|1.2|) . Let us 
put «o := — inf {<7 ' c (s) — f(s) : s S [a; 6]} — ^||g ' c — /|| osc - The function g c for which inf in 
equation (jl.2p is attained is given by 

(2.4) g c (s):=a + g°> c ( S ). 

The problem posed by equation (I2.ip seems a little artificial at first. Its formulation has how- 
ever a substantial advantage over the problem of equation (|1.2p when considered in stochastic 
setting. Namely, when working with stochastic processes the solution given by equation (12. 2|) 
is adaptable to the same filtration as the process itself while the solution obtained in equa- 
tion (|2,4p requires some "knowledge of future". We would like also to mention that condition 
\\g — f\\ osc < c in (|2.1|) implies that the increments of / are uniformly approximated by the 
increments of g°' c with accuracy c. This property might be useful for applications to numerical 
stochastic integration. 

To give the reader some intuition about the functions introduced above we rephrase [6l 
Remark 2.4]: " g c is the most lazy function possible, which changes its value only if it is 
necessary to stay in the tube defined by \\g c — fW^ < c/2". This can be seen on the following 
picture 



FIGURE 2.1. An example of function (in red) and g c (in various colors). 




We hope that we convince the reader that the truncated variation is an interesting research 
object. Moreover, we hope that it will be useful both in the theory of stochastic processes and 
in their applications. The first step towards this goal were undertaken in |8] and [7j e.g. in [8] 
was calculated the Laplace transform of UTV C and DTV C for a Brownian motion with drift 
and in [7] are presented possible applications to the approximation of stochastic processes and 
stochastic integration. 

We plan to report shortly on further findings. 



2.1. Joint structure of TV C , UTV C and DTV C . We will now describe the structure of 
TV C , UTV C and DTV C . The construction is described in more details in [6l Section 2]. Let 
— oo < a < b < +oo and let / : [a; b] — > R be a cadlag function. For c > let us assume that 
(2.5) 

r£/:=inf \s>a: sup / (t) - / (a) > c I < T c D f := inf [s > a : / (s) - inf / (t) > c) . 

[ te[o;a] J I te[a;s] J 

i.e. the first upward jump of function / of size c appears before the first downward jump of the 
same size c or both times are infinite, i.e. there is no upward or downward jump of size c. Note 
that when this condition fails one may simply consider function — /. Now we define sequences 

{ T U, fc)^ ' ( T ^)r=-i ' in followin S wa ^ : T D,~i = a > T U,0 = T&f ^d for k > 0: 
r£ )fc :=inf \ 8 >T& tk : sup f (t) - f (s) > c 

T^ k+1 =ini\ S >T c D . k :f{s)- inf f (t) > c 



T C D,k-vTu,k) ^Kand 



Next let us define two sequences of non-decreasing functions m k 

Tjj k ;Tp k J — > ]R for k > such that Tp k _ l < oo and Tfj k < oo respectively, with the 



formu 



a.s 



m% (s) := inf / (t) , M c k (s) = sup / (i) , 

Similarly, let us define two finite sequences of real numbers {m k } and {M k }, for such fc's that 
T£ fc _ 1 < oo and T^ k < oo by 

(2.6) m fc := m fc {Tu,k~) = r mf /(t), 

(2.7) M c k :=M c k {T^ k -)= sup / (t) . 

The above definitions are simple however may be hard to read without pictures. We hope the 
following will be helpful. Note that we present the same function as in the previous example 




The main results of this section is (cf. [H Theorem 2.3]) 
Theorem 16. For any cadlag function f : [a; b] t— > R such that Tfjf < Tfof we have 



UTV°(f, [a;s]) = DTV c (f, [a;s])=0, 



when s G 



and 



UTV^f, [a; a]): 
DTV{f, [a;s]):= 




-c} + M c k (s) 
c} 



} 

}+M c k 



m 



fe+i 



i/s G 



± D,k^ ± U,k+l 



ifse 

c if s G 



1 U,k^ L D,k j ' 
1 D,k' 1 U,k+l 



Moreover, for any cddldg function f : [a; b) i— >■ M and any s £ [a; b] we have 

(2.8) TV(f, [a; s]) = UTVtf, [a; s]) + DTV c (f, [a; s]) . 

2.2. Basic properties of TV C (/, [a;b]) , UTV C (/, [a;b]) and DTV C (/, [a; 6]). We will now 
list some properties, most of which is used in the paper. These are taken from [6l Section 2.4, 
Section 2.5]. Unless stated otherwise the functions considered below are ci^cedli^oeg 

• For any strictly increasing and continuous function s : K — > M 

(2.9) TV C (/, [a; b]) = TV C (/ o a' 1 , [a (a); s(b)}) , 

the analogous equalities hold for UTV C and DTV C . 

• For any / : [a; b] i— > M and any c > we have 

(2.10) DTV C (/, [a; b]) = UTV C (-/, [a; 6]) . 

• For any s S (a; 6) we have 

(2.11) TV C (/, [a; 6]) > TV C (/, [a; S ]) + TV C (/, [s; t}) , 

and the analogous inequalities hold for UTV C and DTV C . 

• On the other hand, for any s G (a; b) we have 

(2.12) TV C (/, [a; 6]) < TV C (/, [a; a]) + TV C (/, [s; t\) + c, 

and the analogous inequalities hold for UTV C and DTV°. 

• For any /, g : [a; b] — > R and c±, c% > we have 

(2.13) TV ci+ C2(/ + ff) [o . b]) < TV c 1(/) [a . b]) + TV c 2(5> [a . ; 

and the analogous inequalities hold for UTV C and DTV C . Note that in above we admit 
some quantities to be infinite in case c\ = or C2 = 0. In particular 

(2.14) |TV C (/ + g, [a; b]) - TV C (/, [a; b})\ < TV(g, [a; b]) . 

These facts were not proved in [6] . We offer a proof in Fact [T7] below. 

• For any / : [a; b] i— > M mapping 

(0, +oo) 3 c M- TV C (/, [a;b]), 

is convex and decreasing hence continuous. The same holds true for UTV C and DTV C . 
Moreover, though not mentioned in [6], it can be easily upgraded to functional setting. 
E.g. we define functional T : (0; +oo) i— > T> (Skorohod space of cadlag functions) given 
by T{c){t) := TV C (/, [a;t]) is convex and decreasing in a point-wise sense. 

• For any / : [a; b] i— > M we have 

(2.15) lim TV C (/, [a; b}) = TV(/, [a; b}) , 
we recall that the right-hand side might be infinite. 



Fact 17. For any f, g : [a; b] — >■ R and ci, C2 > we have 

(2.16) T^ +C2 (/ + 5 ,[a;6]) < TV*(f, [a; 6]) + T^( 5 , [a; b)) , 

and the analogous inequalities hold for UTV° and DTV°. 

Proof. The inequality for UTV C holds by definition equation (11, 3p and the inequality 

max{/ (s) +g(s)-f(t)-g (t) - c x - c 2 , 0} = max {/ (s) - / (t) - ci + <? (s) - g (i) - c 2 , 0} 

< max{/(s) - f (t) - ci,0} + max{^(s) - g (t) - c 2 ,0} . 

By equation (|2.10p we have similar property for DTV C . Finally, to obtain equation (|2.16p it 
is enough to utilize equation (|2.8p . □ 



3. Proof of Theorem [5] 
The proof structure reflects the outline contained in Remark [TUJ We start with 

3.1. Proof for Wiener process with drift. In our proof we will use an Anscombe-like 
result. It is not much more than a reformulation of |13|. Theorem 4.5.5] to our specific needs. 
From now on we will use "<" to denote the situation when an equality or inequality holds 
with some constant which is irrelevant for calculations. Our setting is as follows. Let us fix 
some T > and 

(A(c),^(c)), i>l, 

be sequences of i.i.d. random vectors indexed by certain parameter c G (0, 1]. We define 
(3.1) M c (t) :=min|z>0:^A(c) >t|, 



c{t) \ ¥7 ( ) 

(3-2) P c (t) := [ Z ^ c ) ] " e^§*' 1 G [0;T] - 



Let us observe that such defined M c , P c are cadlag processes. We will use the following 
assumptions 

(Al): For any c > we have Z?i(c) > a.s. and ELq(c) — > as c \ 0. 
(A2): We denote X^c) := Z^c) - (EZi(c)/E£>i(c))A(c). We have EJQ(c) = 0. We 
assume that there exists a > such that 



EX^c) 2 k 2 



EDi(c) 

(A3): There exists 5 £ (0,2] such that 



o~ , as c \, 0. 



- ^0, as c\0. 
EDi(c) 



(A4): There exists 5 > 0, C > such that for any c G (0; 1] we have 

E|Z>i(c)| 1+i < C(E£>i(c)) 1+6 . 
Before formulation of the fact we define 

(3.3) V := V([0;T],R) := {/ : [0;T] h+ R : / is ciicedKioeg} , 

we equip this space with || • H^-norm. This may seem unusual, as the Skorohod metric (see 
[U Chapter 3]) is a more natural choice for space T>. Let us note however that in all cases we 
will obtain the convergence to a continuous limits. In such case both notions are equivalent 
(see [H Section 18]. 

Fact 18. Let T > and assume that (Al)-(A^) hold. Then 

P c ^ d aB, asc\0, 

(3.4) (EDi(c))M c id, as c \ 0, 

where a 2 is the same as in (A2), id(x) = x, and the convergence is understood as weak 
convergence in T>([0; T], R). 

Proof. We define 

n n 

(3.5) S c (n):=J2Zi(c), V c {n) := A(c), n G N. 

1=1 8=1 

Moreover, let us denote /(c) := jgg^M and we recall that JQ(c) := Zj(c) — /(c)-Dj(c). We 
define a family of auxiliary processes 

(3.6) P^^-^cd^J), t>0, 

where H c {n) := S c {n) - f{c)V c {n) and g(c) := (EDi(c))" 1 . By (Al) g(c) -»• +oo as c \ 0. 

Now the proof follows by |13[ Theorem 4.5.5, p. 290]. The assumptions of |13l Theorem 
4.5.5] consist of seven conditions denoted by 74, <S 4 , 5s, <S 7 , <S 8 , Sg and Jiq. These conditions 
read as: 

• (71): (ft e ,fc,£e,fc) , = 1,2, is a sequence of i.i.d. random vectors that take values in 
[0;+oo) x R;' 

• (5 4 ): n e P (k e & >«)—)■ 7Ti (it) as e — )• for all it > 0, which are points of continuity of 
the limitting function it\ (u) ; 

• (^s): n e EK £i fcl| K fc<u | - >■ c(u) as e — > for some u > 0, which is a point of continuity 

of m (u) ; 

• (5 7 ): n £ P (|Ce,fc| > n ) - ► as e — > for every it > 0; 

• (S s ): n £ E |£ Ejfc | ln^ i<i -)• a as e ->■ for some u > 0; 

• (<S 9 ): n e D 2 |£ £>fe | lr| £e fe |< n | -)• 6 2 as e -)• for some it > 0; 

• (i72o) : c = c (n) — J " sdiTi (s) > 0, where 7Ti (s) and c (it) are obtained in (1S4) and (S5) 
respectively. 



Before verifying assumptions we list how our notation translates to the one of |13| . c is e, |~g(c)] 
is n e , Di(c) is and Aj(c) is £ ej j. Condition 7i (p. 287) is obviously fulfilled. Conditions 
54 and 55 (p. 283) hold with 7r(u) = and c(u) = 1 respectively. Indeed, let us fix u > 0. 1S4 
writes as 

r<7(c)lP(A(c) > u) < Ig^u-^ElD^c)^ 6 < \g(c)]g(c)-^ -+ 0, as c \ 0, 

where we used assumptions (Al), (A4) and the Chebyshev inequality. We will use a few times 
an obvious inequality 

(3.7) \x\ Sl+52 > \u\ 5l \x\ 52 , 
valid for any <5i , <?2 > and \x\ > \u\. We check that 

(3.8) ^(c)lEDi(c)l {JJl(c)>u} < ^(c)l«-*E|I>i(c)| 1+i -+ 0, as c \ 0, 

again by (Al), (A4) and equation (13. 7p . The expression in condition 5s writes in our notation 
as [5(c)]EDi(c)l{£) 1 ( c )< M }. By equation (|3.8p its limit does not depend on u and is the same 
as the one of 

\g{c)~\ED 1 (c) 1, asc\0, 
which follows by (Al) and the definition of g(c). 

We will now verify conditions 57, 5s, Sg (p. 287-288) with a = and b 2 = a 2 . Let u > 0, 
the condition £7 writes as 

^(c)1P(|Xi(c)| >«) < ^(cJlu-MEIXiCc)! 2 ^ -+ 0, asc\0, 

where we used assumption (A3) and the Chebyshev inequality. Further we have 

\g(c)]E (|JTi(c)|l { | Xl(c) | >u} ) < \g(c)]u-^E\Mc)\ 2+S -> 0, as c \ 0, 

where we used assumption (A3) and equation (|3.7p . Now S$ follows directly from above and 
the equality 

E (Ai(c)l { | Xl(c) | >u} ) = -E(A A 1 (c)l { | Xl(c) |< u} ) 
which is a consequence of the fact that EAj(c) = 0. Let us now observe that 

\g(c)]E {X 1 (c) 2 l {lXl{c)l>u} ) < ^(cJlu-'ElX^c)! 2 ^ -> 0, as c \ 0, 

where we again used assumption (A3) and equation (13. 7p . By the above considerations we 
have that lim c \ j0 r5(c)l Var(A A i(c)l{|x 1 ( c )|<n}) i s the same as lim c ^ i0 r5'(c)lVar(Xi(c)). Now Sg 
follows directly from (A2). Finally, J 20 (p.285) holds with c = 1 see also QJ3J (4.5.2)]. 

Now, it is straightforward to identify the limit using the description in |13[ p. 284 and p. 
288]. Indeed, the process kq (p. 284) is simply given by Ko(t) = t (notice that on the right 
hand side of formula (4.5.1) in |13[ p. 284] one should replace z by y) so its inverse z^o is also 
^o(i) = t (which proves equation (|3.4p ). The process £0 is the same as in Aq^ (p. 288). Let us 
note that Silvestrov's — > u is the same convergence we need, see |13l Definition 2.4.2]. 

□ 



Let W be a standard Wiener process and fj, E R. We denote a Wiener process with drift fi 

by 

(3.9) X t := W t + Aii, t > 0. 



Our first result is the following 



Lemma 19. Let T > and X be a Wiener process with drift given by \3. 9\) . We have 

(3.10) (x t -nt, TV(X,t)-^j ^ d (w t ,3- 1/2 B t y asc\0, 

where (W, B) are independent standard Wiener processes. The convergence is understood as 
weak convergence in C([0;T],R) 2 topology. 

Proof. We fix a, b G R and define A c t := aTV c (X, t) + bX t -{^ + bfj) t. Assume that we proved 
that 

(3.11) A c -> d (a 2 /3 + b 2 ) B, as c \ 0, 

weakly in topology of C([0;T],R), where B is some standard Brownian motion. The conver- 
gence for (a, b) = (1,0) yields that {TV C (A A , t) — ^} c>0 is tight, hence also is the sequence 
of vectors on the left side of equation (|3.10p . Now, applying the Crami'^cer-Wold device [TJ 
Theorem 7.7] we easily justify that the convergence of finite-dimensional distributions, hence 
(p3T0]) indeed holds. 

Now we are to prove (|3.1ip . We transparently transfer all quantities of Section [2] to the 
stochastic setting by applying them in a pathwise fashion, i.e. / (t) = Xf. We denote 

Gi(c) := (Mf -mi-c) + (Mf - m^ +1 - c) , Hi(c) := (Mf - m\ - c) - (M? - m c i+1 - c) . 

By Theorem 1161 and continuity of X we have TV C ^A A , Ty^j = Yli=oYi( c ) ( m ^ ac ^ this holds 
under additional assumption (|2.5p but this is irrelevant in the limit). By (|2.3p and again by 
Theorem 1161 we have 

k-l 

(3.12) WXT^-J^Hi^Kc, a.s. 

i=0 

(Note that Xq = 0.) We fix some a,b £R and for any i > write 

(3.13) Zi(c):=aGi(c) + bHi(c). 
We denote also 

(3.14) A(c) := n,i ~ T u,i-i> i > 1, and D (c) := T£ fi . 

The following simple observation will be crucial for the further proof. Let us notice that by 
the strong Markov property of X and its space homogeneity we have that {Zi(c)} i>l and 
{A(c)} , t>l are i.i.d. sequences. For i — the distributions are different because of "starting 
conditions". The first part, i.e. the values for i = disappear in the limit. For notational 
simplicity from now on, we will implicitly assume that i > 1. 

We will proceed now in the direction of utilizing Fact [THJ To do this, we need to calculate 
moments, fortunately enough |14] provides us with sufficient tools. Using the notation from 
|14| we may write 

(T C DA - T& ti , Mt-m\- c) = d (T c , X(T C ) + c), 



where T C ,X are defined in |141 Introduction]. Hence the formula |14[ (1.1)] reads as 

/o - ^ to / r*rr r \ n /W mr \n 5exp( — (« + u)c) exp(ac) 

(3.15) Eexp(a (Mf - - c) - /3 (T^ - 1^) ) - 



5 cosh(Jc) — (a + /x) sinh(<5c) ' 

where (5 = \/ fJ, 2 + 2/3. This formula is valid if a < 5coth(<5c) — /i, and /3 > 0. If /i 7^ we may 
also put /3 = 0. One may check that the pair 

is independent of (Tg f — Try i , M? — m| — c) . It becomes obvious when one recalls definitions 
of Section [2] ( (|2.6[) and (|2.7p in particular) and apply the strong Markov property of X. 



Moreover, we notice that the law of [T^ i+1 — Tfj i; M 4 C — m? +1 — cj is the same as the one of 

(T£, i — Ty iy Mf — mf — c) if we change the drift coefficient to — [i. Therefore, by |144 (1.1)] we 
get 

5 exp(— (a — /i)c) exp(ac) 



(3.16) Eexp(a (M? - m c i+1 - c) - /3 (T^ l+1 - T^)) 



<5cosh(<5c) — (a — /i) sinh(<5c) 



where 5 = \J + 2/3 (with the same restrictions as before). These are enough information to 
check the moment conditions required in Fact [18] Calculations are easy and straightforward 
however lengthy. We decided not to include all of them in the paper. Instead, we list crucial 
steps and provide the reader with the Mathematica notebook with all detail^. Combining the 
above equations and putting a = (note that this is always possible for c's small enough) we 
get 

Eexp(-/3A(c)) = 2/3 + f - - T . 

+ fJ, 2 + p cosh (2^2/3 + fi 2 ) 

Differentiation yields 

, s 2sinh(c/x) 2 9 „. 

3.17 EDi c = = 2c 2 + O c 4 . 

M 

One can check that the formula above is valid for \l = when we take the limit. This applies 
also to the subsequent moments formulae. Moreover 

EA(c) 2 = ^c 4 + 0(c 6 ), EA(c) 4 = ^c 8 + 0(c 10 ). 
3 11)5 

This is enough to check conditions (Al) of Fact 1181 as well as (A4) with 5 = 3. Analogously, 

by putting /3 = we calculate that 

4/i 2 

(3.18) Eexp(e^(c)) = ((ffl _ b) (1 _ ~ _ ^ ((fl + 6) (1 _ e2cAt) ^Tj • 

Again, by differentiation one gets 

ig EZ-(c) = 2sinh (cA*)(«cosh(c^) + ^sinh(c/x)) 

A* 



^http: //www, mimu w. edu.pl/~pmilos/moments .nb| . The file can be viewed with a free application available on 



http : //www. wolf ram. com/products/player/ 



And therefore 

(3.20) — ^ = + a coth(c/x)) = - + 6/x + O(c). 
Now we have 

Ecxp (aZ;(c) - 0Di(cj) = 

2(2/3 + M 2 ) 



-a 2 ct 2 + b 2 a 2 + 2fca/i + 2 (/3 + fi 2 ) + (a 2 a 2 + 2/9 - 6a(6a + 2ft)) cosh(2c v '20 + /i 2 ) - 2aa v / 27T+7t 2 sinh(2c V^TT" 2 ) 

Following axiom (A2) we denote Xi(c) := Zi(c) - (EZi(c)/EDi(c))A(c) 
Using this one may check that 

. 3a 2 - b 2 - Aabcfi + (a 2 + b 2 ) cosh(2c^) - 4a 2 c//coth(c/i) + 2ab sinh(2c/i) 

Now it is straightforward to check (A2) of Fact [18] viz. 
EXi(c) 2 1 



cshs(c/i) (3a — b — Aabcfi + (a + 6 ) cosh(2c/i) 

-4a 2 c/icoth(c/i) + 2a6sinh(2c/z) = + 6 2 ^ + ^afrc/z + 0(c 2 ). 



Finally, one can check that EAj(c) 4 < c 4 and hence (A3) is verified with 5 = 2. Having 
checked all conditions we conclude that for P c (t) defined by (|3.2|) and (j3. 13|) . (j3. 14[) we have 

,2 \ V2 



a . , \ . d / a 



P c (i) - {- + bfij * ( y + 6 ) B > as c \ °- 

Therefore in order to prove (|3.11|) it is enough to to show that P c (t) — aTV c (X, t) — bXt — > d 0. 
By the property equation (|3.12|) and the continuity of X it follows easily that it suffices to 
concentrate on the case (a, 6) = (1,0), that is At = TV C (X, t). Since Dq(c) has different 
distribution than £?j(c) for i > 1 we introduce two auxiliary objects 

{n > n 

«>0:^A(c)>t , 5 c (n)=^Zi(c), 
i=0 J i=0 

and 

P c (t) := 5 c (M c (t)). 

This differs slightly from P c , however, one easily checks that P c — P c — > d 0. By Theorem [16] 
we see that the processes TV C (X, t) and <S C ^M c (i)^ coincide at random times T^, i > 1 
moreover, both are increasing, hence, for any T > and e > 



sup 

te[o,T] 



TV c (A,i) - S c (M c (t)) > e < P sup Z^ (t) (c) > e . 



Using this we estimate 



P sup 
Ue[o,T] 



TV?(t)-S c [M e (t) 



> e 



< 



max Z k (c) > e + P [ M c (T) > — + 1 

fc<2T/E£>i( C )+l * W - / V ~ ( c ) 



2T 



The first term could be estimated by the Chebyshev inequality and the estimates of EZi(c) 
and EDi(c) 

EZi(c) 4 



max ^t(c) > e I < 

fc<2T/EDi(c)+l 



2T 

EDi(c) 



+ 1 



0, as c — > 0. 



The convergence of the second term to could be established by Fact | 



□ 



3.2. Proof for diffusions with a = const. We start with a yet simpler case. Namely, let W 
be a standard Brownian motion and X be a random variable. Let us define process Z by 

Z t := W t +Xt, t> 0. 
Lemma 20. Let T > 0. Let us assume that that W and X are independent then 



X,W,3~ 1/2 b) , asc\0 



X, W, TViZ, t) - - 
c 

where B is a standard Brownian motion and X, W, B are independent. The convergence is 
understood in weak sense in the product topology o/R x C([0;T],M) 2 . 



Proof. We will proceed by the very definition of the weak convergence. Let / : RxC([0; T], 
R be a bounded continuous function. We have 

lim E/ [X, W, TV C (Z, *)--]= lim EE f / ( x,W, TY C (Z, t) - - 

e\0 V C I c\0 V \ C 



Elim E ( f[x,W, TX C (Z, t) - - 
c\o \ \ c 



X = x 

') 

] = Ef (x,W,3~ 1/2 B 



□ 



= EE(^/ (x,W,3~ 1/2 B 
where we used Lemma [TU] and the Lebesgue dominated convergence theorem. 

We will deal now with diffusion given by an equation 
(3.21) dX t = dW t + v(X t )dt, X = 0, 

i.e. we set a = 1 in equation fjl -4[) . We assume also that [i is bounded and Lipschitz. This 
process is essentially a Brownian motion with "a variable drift". We denote 



(3.22) 



fjL* = SUp\fl(x)\ < +00. 



We will us the discretion technique. To this end we need to be able to control the increments 
of X. The following simple lemma is the first, most crude step of our analysis 



Lemma 21. Let t > and 5 > then for any b > we have 



Proof. We know that 



sup \X S -X t \>(n* + b)S < 2exp (-b 2 8/2) . 

sG[t;t+S\ J 



X t = X + W t + [ fi(X s )ds. 
Jo 



Hence, we have X s - X t G (W s -W t — fJ,*(s — t),W s — W t + fJL*(s — t)). Now the lemma follows 
by [IH Proposition II. 1.8]. □ 

Let us fix T > 0, n = 1, 2, ... and denote tf := G {0, 1, . . . , n}. We define the "approxi- 
mated" truncated variation process by 

[ntj-l 

(3.23) AT^ c (t):= TV c (X,[^;^ +1 ])+TV c (x,[^ j; t]), 

Its name is justified by 
Lemma 22. We have 

ATV n ' c (t) - TV{X, t) -»• 0, a.s. w;/ien c \ 0, 
and i/ie convergence is understood in C([0;T],M) topology. 

Proof. By ([2~TT]l one easily verifies that ATV n > c (t) < TY C (X, t). On the other hand, by (pTL2]) . 
TV C (X, £) - ATV n > c {t) < nc, for any t G [0; T]. □ 

We will take now a detour of the main flow of the proof in order to collect weak convergence 
facts used below. First we recall the Prokhorov metric. Let (S, d) be a metric space and V(S) 
be the space of Borel probability measures on S. We topologise 'P(S') with the Prokhorov 
metric 

(3.24) d P (P, Q) := inf {e > : P(F) < Q{F e ) + e, for all closed F C S} , 

in the above expression F e := {x G S : inf yg ^ d(x, y) < e} . It is well-known that when (S,d) 
is separable then convergence with respect to dp (•, •) is equivalent to weak convergence. We 
refer the reader to [21 Chapter 3] and Theorem 3.3.1] in particular. Given two random 
variables X, Y with values in the same space we will write 

dp (XX) :=d P (C(X),C(Y)), 

where C(X) denotes the law of X. 

In some parts of our analysis we will need the space of ci'^cedli^ceg functions 2?([0;T],K) 
introduced by (|3.3p . We will also use the following product space 

(3.25) C xV:= C([0;T],M) x £>([0;T],R), 
always with the norm given by \\(f,g)\\ := H/IL + Halloo- 



Lemma 23. Let (X, Y) be random variables with values inC xT>, moreover let A be an event. 
Then 

d P ((X,Y),(Xl A ,Y)) <2(1-P(A)). 

Proof. It is enough to apply [2j Theorem 3.1.2] with fj, = C((X, Y), (X1 A ,Y)). □ 

Lemma 24. Let X : = (Xi, X2) and Y := (Yi, Y 2 ) be a random variable with values inC xP 
such that 

P (11-^1 - Y i\\oo > e/2) < e/2 and P (||X 2 - Y 2 \\ OB > e/2) < e/2, 

^ (x,y) <e. 

Proof. We calculate 

PCIKXi-y^Jfa-ya)!! > e) < P(||Xi -y|| > e/2) +P(||X 2 -y 2 || >e)<e, 
now the proof follows directly by application of |2j Theorem 3.1.2]. □ 

We are ready to prove the main result of this part of the proof which is an upgrade of 
Lemma [19] to "simplified diffusions" given by equation (|3.2ip . 

Fact 25. Let T > 0. We have 

(3.26) (x, TV°(X, t) - ~J ^ d (X, 3~ 1/2 B), asc\0, 

where the convergence is understood as weak convergence in C([0;T],R d ) 2 topology and B is a 
Brownian motion independent of X. 

Proof. We recall that := -T, fix some A > fi* + 1 and define random sets 

A? := [X t n - A/n^Xtn + A/n 1 ' 4 }. 
We also define random variables 

and events 

ie{0,l,...,n-l} 

Using Lemma [21] we check that for n large enough we have P(£") > 1 - 2n exp(-n 1 / 2 /2)- 
Consequently, P (E n ) — >■ 1 as n — > +00. For n G N we define ci^cedii^ceg processes {X™} tg j . r j 
which approximate our diffusion: 

X™ := X t? + - t?) + W t - W t n, whenever t £ [t?; 

One easily checks that X n — > X a.s. with respect to || • H^. Let us recall (|3.23p . we define its 
counterpart for X n , viz., 

\nt\-l 

H^(t):= £ TV c (XM^;^ +1 ])+TV c (xM^ j; t]). 
i=0 

One checks (using the same method as in the proof of Lemma [22]) that 

H n ' c (t) - TV c (X n , t) -> 0, a.s. when c \ 0, 



norm || • H^. On each interval t S [ifi^^-i) we have 



(3.27) 




We observe that conditionally on E™ this expression defines a function of t which is Lipschitz 
with constant w n < Ln -1 / 4 for some L > 0. This follows by the fact that [i is a Lipschitz 
function itself. By (|2. 13|) applied with c\ = c and C2 = 0, conditionally on E™, we have that 

TV c pT\ [t» *]) - w n (t - i?) < TV C (X, [t?,t]) < TV C (X", + «/ n (t - t?), 

for any t € [t";i$+i]- Further 

(3.28) l E nH n ' c (t) - w n T < l E nATV n ' c (t) < l E nH n ' c (t) + w n T, 

for any te [0;T]. In other words: lllsnATF^^-lsniF^i)!!^ < 2w n T. Lemma |M1 implies 
that 

(3.29) dp ((X n ,l E nATV n > c ), (X n ,l E nH n ' c )) < Aw n T. 

It will be crucial that this estimate is uniform in c. Let us denote L n ' c := (H n ' c (t) — c/t). 
Lemma [20] applied term by term to H n ' c yields the functional convergence 

(3.30) (L n > c ,X n ) ^ d (3~ 1 / 2 B,X n ), asc\0, 

where B and X n are independent. In the above, we understand the convergence as the 
functional one in C X T) (see also ()3.25() ') 

The rest of the proof will follow by a metric-theoretic considerations. Let us denote 

X 1 (c) := (TV c (X,t)-t/c,X), X 2 (c,n) := (TV C (X, t) - t/c, X n ) , 

X 3 (c,n) := (ATV n > c (t) - t/c,X n ) , X 4 (c,n) := (l E n(ATV n > c (t) - t/c),X n ) , 
X 5 (c,n) := (l En (H n > c (t) - t/c),X n ) , X 6 (c,n) := (H n > c (t) - t/c,X n ) , 

X 7 (n) := (s^B, X n ) , X 8 := {^T^B, x) . 

Let us fix some e > 0. We find n\ 2 such that for any n > n\ 2 we have dp (X\{c), -^(c, nj) < e 
which is possible by Lemma [25] and convergence X n — > X We find 713,4 such that for any 
n > ^3,4 we have dp (X^(c, n), X^c, n)) < e which is possible by Lemma [23] and estima- 
tion of the probability of E n . Further we find 774,5 such that for any n > 774,5 we have 
dp (X4(c,n), Xz,(c,n)) < e which is given by equation (|3.29p . Next, we check that for any 
n > 773,4 we have dp (Xz,(c, n), Xq(c, n)) < e as well. Finally, we choose 777,8 such that 
for any n > 777,8 we have dp (Xf(n), X$) < e which holds by Lemma [25] We denote 
N = max(ni 5 2, ^3,4, ^4,5, n 7,s), obviously for this N all the above inequalities hold simulta- 
neously for any c > 0. 

Now we choose cq such that for any c < cq we have dp (^(c, N), X 3 (c, N)) < e and 
dp (Xq(c, N), X-/(c, N)) < e. The first one is possible by Lemma [22] and Lemma [24] and the 
second one by equation (|3.30p and again Lemma [25] Using the triangle inequality multiple 
times one obtains 

dp (Xi(c),Xs) < 8e, for any c < Co, 
This yields convergence equation (|3.26p since e was arbitrary. □ 



Remark 26. We strongly believe that it is not possible to improve the above proof to general 
diffusions. The main reason is that without a = const assumption equation equation (|3.2T[) 
is not longer true. Consequently, the estimate in equation (13. 29ft does not depend only on 
w n but also on c. Even worse, one can check that the estimate diverges to infinity as c \ 0. 
We could change n and c simultaneously in a smart way so that the estimate is still useful. 
However a new problem emerges then, namely estimate in Lemma [22] also depend on n and c. 
It appears that it is not possible to change n and c is such way that both estimates converge 
to when c \ 0. 

3.3. Proof for general diffusion. Now we proceed to the general case. Before proving 
Theorem [5] we present some measure-theoretic considerations. In the reasoning below by W 
we denote the Wiener measure on C([0;T],R), see e.g. |11] Proposition 1.3.3], and by H we 
denote the Cameron-Martin space, see |11] Definition VIII. 2.1]. Moreover by T~L we denote 
algebra (i.e. class closed under finite sums and finite intersections) generated by open balls 
with centers in H. We have 

Lemma 27. Let h : C([0;T],R) i-> R + be a measurable mapping such that f h(f)W(df) = 1. 
Then for any e > there exists m € N, sets A\,A2,... , A m € T~L and h\,h2, ■ ■ ■ , h m € M+ such 
that 

(3.31) J\he(f)-h(f)\W(df)<e, 
where 

m 

US) :=X>W/)- 

i=i 

Moreover, one may choose such A\,A2, ■ ■ ■ , A m that for all i < m, W(cL4j) = 0. 

Proof. In the proof we will write C instead of C([0; T], R) and B(f, r) will denote an open ball 
with convention B(f, 0) = 0. Let us notice that without loss of generality we can assume that 
h is bounded by some I > and has compact support, say contained in ball B(0,R). Indeed 
for any function h and any e > we can choose l,R such that J c |^(/)l{/ l <i}l{/es(o,il)} — 
/(/)|W(/) < e/2. Now it is enough to approximate /i(/)l{h</}l{/g_B(o,_R)} with accuracy e/2. 
Therefore from now on we will work implicitly with the assumptions listed above. 
Let us denote 

S k :={f€C:h(f)€(ke/2,(k + l)e/2]}, 
for k £ {0, 1, . . . , 2//e}. We note that by our assumption sets are bounded. We put 
S ■= e 2 /(4l 2 ). By the regularity of W (see [H Theorem 1.1.1]) we can find open sets Ok such 
that 

(3.32) S k cO k and W(O fc \ S k ) < 5/2. 

It is well known that C is a separable space and H is its dense subspace so one can easily 
find a countable subset {/i,/2, • • •} C H which is dense in C. For each /j we define r\ := 
sup{r : B(fi,r) C Ok} /2 (by convention we put r\ := if the set is empty). One promptly 



proves that O k = [J i B(fi,r^). By the continuity of measure there exists G N such that 

W(O fe )-W(U B(fi,rf))<5/2. 



i<i k 



Let us denote A k := Ui<i fc B{fii r i)- We now define 

We will now show that h e is a good approximating function. We recall that by the construction 
A k C O k and W(O k \A k ) < 6/2. This together with equation (f3T32]) yields that W(S k AA k ) < 
5, where A denotes the symmetric difference. We have 



f\he(f)-h(f)\W(df) 



J>e/2) l Ak (f)-J2Kms k (f) 



W(d/) 



<Y^ / \(ke/2)l Ak (f) - h(f)l Sk (f)\W(df) 
k Jc 



< | riS k ) + l^2 ^(S k AA k ) <i + l^5 = e. 

k k 

To check W(dA k ) = is is enough to prove that for any / G H and any r > we have 
W(dB(f,r)) = 0. By [lU Theorem VIII. 2. 2] it is enough to show that W(dB{0,r)) = 0. This 
holds by the fact that sup of the Wiener process has a continuous density (see |114 Section 
III.3]). □ 

Finally we present 

Proof, (of Theorem [5]) . We first will show that in order to prove equation (jl.5p it is enough 
to prove 

(3.33) (x, T\ C (X, t) - -> d (X, 2M) , as c \ 0, 

where M is the same as in Theorem [5j Since Xq = by equation (|2.3p there exists process 
{Rc(t)}te[0-T\ sucri that ||i? c ||(X) — c almost surely and 

(3.34) X t = UTV C (X, t) - DTV C (X, t) + R c (t). 
This together with equation (12.8P yields that 



(3.35) UTV C (X, t) = \ (TV C (X, t)+X t - R c (t)) , 

Therefore 

UTV c (X,i) - \ (VQ+Xt} = \ (rv c (x,t) - - l -R c (t). 

Now the convergence follows simply by fact that TV°(X, t) — {Xt) / c is a continuous transforma- 
tion of equation f|3.33[) and by [TJ Corollary 2, p. 31], [TJ Theorem 4.1]. A completely analogous 



argument proves the convergence of DTV C (X, t) — \ ((Xt)/c — Xt). The joint convergence in 
equation (jl.5p can be established in the same way. 

It will be more convenient to work with additional assumption that 

(3.36) Ci > a > C 2 > 0, 

for some constants C±,C2 > 0. At the end of the proof we will remove this assumption. 
Diffusion equation fjl -4[) writes in the integral form as 

-t r t 



X t = [ a(X s )dW s + f fi(X s )ds. 
Jo Jo 



Let us define fit : = J Q * a(X s ) 2 ds = (Xt), its inverse at : = inf {s > : f3 s > t} and 

X t :=X at , t G [0; T ] , where T := C 2 2 T. 

By the time-change formula |10[ Theorem 8.5.7] we obtain that X is also a diffusion fulfilling 
equation 



lo <T 2 {X S 

for some Brownian motion W. We chose such To that the definition is valid (i.e. ay < T). 
We note also that x t— > ^r^y is a Lipschitz function. Let us now denote the natural filtration 

of X (and W) by J-. Making the reverse change of time we get Xt = Xp t . We denote also 
Qt := J-p t . Now we can apply Fact 1251 We know that 

(3.37) (<JFV C (X, t),xj {B, X), 

where CTV C (X, t) := TV C (X, t) — § and B and X are independent. Let us also note that 
CTV C can be regarded as a measurable mapping CTV C : C([0;T],M) (->• C([0;T],R). 

Now, let K E ~H be non-empty set. We check that the measure P [X G -\X G K ) is abso- 



lutely continuous with respect to P ( X G -J . Indeed one needs only to check that P (X G if) > 

0. By the Radon-Nikodi'^oem theorem [H Theorem A. 1.3] there exists a measurable function 
h such that 

(3.38) Pfle d/|Jf G if) = fc(/)P G d/) . 

Using this fact we can leverage equation (|3.37p . Let us first note that by the portmanteau 
theorem [1] Theorem 1.2.1] and [1] Theorem 1.2.2] and standard topological considerations we 
know that equation (|3.37p is equivalent to 

(3.39) p({cTU c (A > )GKi}n{A > Gif 2 }) ^P(SgKi)p(a > gK 2 ), V^^ew- 



Further, by equation (|3.38p . we have 

a c : =p({ctu c (x) g Ai} n{ie A 2 j n{x g A}) = p(X g if) 



cru c (x) g Ai} n {x g a 2 } |x g a > 

= P (X G A) j h(f)l {CTV c( f)eKl }l {feK2 ^ X g d/ , 

where P ^X G dfj is the same as the Wiener measure. We now approximate h with accu- 
racy e = 1/n with simple function h n satisfying conditions of Lemma, [27] (we use additional 
superscript n to denote the case we are referring to). Hence we have 

(3.40) f \h n (f)-h(f)\r(xedf)<-, 

We define 

a n c := P (X G A) J h n (f)l {CT y c(f)eKl} l {feK2} F (x G d/) . 

One easily checks that for any c > there is \a c — a™| < 1/n. Applying equation f|3.39[) we 
obtain 

a n c = P (X G A) h?V (\CTV C (X) G K x \ n <jx G A? n A 2 }) 

> cV P (-B G ATi) P (X G A) J] /#P (l ei^n A 2 



i=l 



i=l 



= P (B G Ai) P (X G A) ^ /i n (/)P (x G d/) =: a n . 
It is easy to check that \a n — a\ < 1/n, where 

a := P (B G Ai) P (X G A) ^ /i(/)P (x £ d/j =P(B£ Ai) P ({x G A 2 } n {X G A}) . 
Using the standards arguments we obtain that for any K\, A 2 , A G % 

p (jcru c (x) g Ai} n ix g a 2 } n{i£ a}) 

^ cV) P (-B G Ax) P (X G A) ^ fc(/)P (1 £d/)=P(B6 A x ) P ({I G A 2 } n {X G A}) . 

Using pQ Theorem 1.2.2] in the same spirit as in the case of equation (|3.39p we get 

(CTV C (X),X,X) -> d (B,X,X), asc\0. 

where B is independent of (X, X) hence also of ft. Changing the time according to this process 
we obtain 

TV c (x,/3 t )-^ d 



This equation is well-defined as long as t < Tqq = T§jC\ = TC^/Cf. Our final step is to use 
(|2.9p in order to get 

TV c (X,t)-^±^ d B t . 
c 

So far we have obtained convergence in the space C([0; Tqq], R). Taking the initial value of T 
larger (which is possible as our diffusion is well defined on the whole line) we can obtain the 
convergence in C([0; T] , R). 

We are yet to remove assumption (|3.36p . For any N > we put 



a N (x) :- 



fi(x), if < N, 
fi(N), iix>N, 
jjl(-N), iix<-N. 



'a(x), if \x\ < N, 
a(N), ifx>N, n N {x):-- 
a{-N), ifx<-N, 
We define a family of diffusions by 

dX t N :=a N (X t N )dWt + »(X?)dt, X» = 0. 

We assume that this diffusion is driven by the same W as in (jl,4p and that X, X N are coupled 
in such a way that X? = X t and (X t N ) = (X t ) whenever t < t n := inf {t > : \X t N \ > N} = 
inf {t > : \Xt\ > N}. The solution of fjl -4j) is a continuous process and exists on the whole 
line, therefore for any T > we have 

^{t n <t} —>n^+oo 0, a.s. 

We notice now that X N fulfills (|3.36p . hence the thesis of Theorem [5] is already proved for it. 
The quantities studied in the proof are equal for X N and X on the set {r^ < T}. Using the 
metric-theoretic arguments as in the proof of Fact [25] one easily concludes the proof. □ 

4. Proof of Theorem [1] 

As indicated in Introduction the proof splits into two parts. In the first one we will prove 
Theorem Q] in the case when X is a Wiener process with a drift. This will serve as a key step 
for the second part of the proof in which, using time change techniques we will elevate the 
result to a general class of semimartingales. 

4.1. Proof for Wiener process with drift. This is much simpler compared to the proof 
of Lemma [19l therefore we provide only a sketch leaving details to the reader. Let X be a 
Wiener process with drift, i. e. 

X t := W t + fit, 

for a standard Wiener process W and /i£K. We have 

Lemma 28. Let T > and let {^}t g [o r] ^ e a Wiener process with drift. Then 

lime TV c (X,t) -> (X) t , a.s. 

The converge is understood in the C([0;T],R) topology. 

Proof. Firstly, we recall S c (n) defined in equation (|3.5|) and Zi{c) given by equation (|3. 13[) . 
We want to show that process Xf(c) := cS c (\g(c)t\) converges to a linear function. Let us 



consider 

n 

M n (c) :=cY,(Zi(c)-EZi(c)). 

i=l 

It is a centered martingale. Differentiation of equation f|3. 18[) yields that (we have a = 1, b = 
in this case) 

E(Z,(c) - EUc)f = 2cosh(2c M )sinhM 2 = ^ + 3) 

M 

Therefore, by the Doob inequality and equation (|3.17p we have 

E( sup [cS c (\g(c)t-]) - c\g(c)t-]EZ t (c))} 2 ) < LTc 2 , 

\te[0;T] J 
for some constant L. Using equation (13.19|) and equation (13.20|) one obtains 

X t (c) — > id, a.s., 

where id(t) = t and the converge holds in C([0;T],M) topology. Now one proves an analogous 
convergence for process V c (\g(c)t\), which is roughly speaking the inverse of M c . To finish the 
proof one needs to argue similarly as in the second part of the proof of Lemma [19j □ 



4.2. Proof for semimartingales. Now we assume that X t = Xq + M t + At, where M is a 
continuous local martingale and A is a process with bounded variation. 

To avoid notational inconveniences we assume that M, A are defined on [0; +oo) (one can 
simply put a constant process after T). Let us now introduce an additional standard Brownian 
motion (3 independent of X and denote 

X e :=X + e(3, e > 0. 

This is a simple trick to avoid the case when (X) is not strictly increasing. Indeed we have 
(Xf) = {Xt) + et. Obviously this is a strictly increasing function. Moreover its inverse, 
denoted by a, is almost surely Lipschitz with constant smaller then e _1 . Let us denote also 
M € := M + e/3. The DDS theorem [111 Theorem V.1.6] ensure that there exists a Brownian 
motion B such that 

Ml:=B {xt) , te[0;T\. 

Using (|2.9p we have 

(4.1) cTV c (X\t) = cTV c (B t + A at ,(X*) t ). 

Let us fix TV > 0. Applying (j2.14|) to the paths of B t + A at we get 

\TW c (B t + A at ,{X e t ) AN)- TV c (B t , (X^) AN)\< TY(A at , {X e )t A N) . 
Using Lemma [28] and the above estimate one gets that 

c TY c (B t + A at , {XI) A N) ^ cN0 {XI) A N a.s. 



where convergence is understood in C([0;T],R) topology. Moreover, the limit agrees with the 
limit of equation (|4.ip on the set {(X £ ) < N}. Hence we obtain 

cTV c (X\t) (X e ), a.s. 

Our aim now is to get rid of e. We fix some a G (0, 1) and notice that by (|2,13p we have 

c TV C (X, t)<c TV ac (X e ,t) + c TV (1 ~ a)c (eS, t) . 

Therefore we have 

limsupcTV c (X,t) < a~ 1 {X e t ) + (1 - a)~ 1 e 2 t = a~ l {X t ) + ((1 - a)" 1 + a" 1 ) e 2 t. 

By converging e — > and a — >• 1 one can obtain 

lim sup c TV C (X, t) < (X t ) , a.s. 

c\0 

Analogously one obtains a lower-bound for lim inf. Therefore we proved that for any t > we 
have 

limcTV c (X,t) = (X t ), a.s. 

c\0 

This is a one dimensional convergence but one easily extends it to the finite dimensional one. 
Moreover, since the trajectories are almost surely increasing the finite dimensional convergence 
can be upgraded to the functional one. This follows by the simple fact that if f n £ C([0; T], R) 
is a sequence of continuous increasing functions converging point-wise to a continuous function 
then the convergence is in fact uniform. 



In order to prove the convergence for UTV C it suffices to use equation (13. 35ft . DTV C follows 
similarly. 



5. Proof of large times results 

In this section we will only prove Theorem [12j It follows by a similar argument as in 
the proof of Lemma QjJJ This time c is fixed and n will go to infinity. The analogues of 
equation (13.13|) and equation (|3,14p are given by 

Zi{n) := n- l l 2 Yi{c). 

and 

A(n) := n~ l {T^ - T^), i > 1, and D (n) := n' 1 ^. 
By equation p.20p (with a = n 1 ^ 2 , b = 0) we have 

1^4 = n _1 / 2 ucoth(cu). 



We define Xi(n) := Zi{n) — (EZj(n) /MDi(n))Di(n) . Repeating calculations as in the proof of 
Lemma [T9l one obtains 



theorem it is enough to apply Fact 1181 This is an easy task. Above we already checked (Al), 
(A2) and (A3) are trivial. (A4) holds with any 5 > 0. 

We skip the proof of [11] which is a simpler version of the proof in Section 14.11 Proofs of 
Theorem [13] follows similarly to the one above with an exception that Zj (n) = M[ — mf — c 
in the case of UTV C and Z^n) = Mf - m c i+1 - c in the case of DTV C . 
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Vax(Xi(n)) 



3 + cosh(2c^) — 4c// coth(c/t) 



One checks that Var(JQ(n))/EA(ra) 



n/i 2 

(<r^) 2 as in Theorem 1121 Now in order to obtain this 
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